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Abstract 

An important tool in the study of the complexity of Constraint Satisfaction Prob- 
lems (CSPs) is the notion of a relational clone, which is the set of all relations express- 
ible using primitive positive formulas over a particular set of base relations. Post's 
lattice gives a complete classification of all Boolean relational clones, and this has been 
used to classify the computational difficulty of CSPs. Motivated by a desire to un- 
derstand the computational complexity of (weighted) counting CSPs, we develop an 
analogous notion of functional clones and study the landscape of these clones. One of 
these clones is the collection of log-supermodular (Ism) functions, which turns out to 
play a significant role in classifying counting CSPs. In the conservative case (where 
all nonnegative unary functions are available), we show that there are no functional 
clones lying strictly between the clone of Ism functions and the total clone (containing 
all functions). Thus, any counting CSP that contains a single nontrivial non-Ism func- 
tion is computationally as hard to approximate as any problem in ^^P. Furthermore, 
we show that any non-trivial functional clone (in a sense that will be made precise) 
contains the binary function "implies". As a consequence, in the conservative case, all 
non-trivial counting CSPs are as hard as #BIS, the problem of counting independent 
sets in a bipartite graph. Given the complexity-theoretic results, it is natural to ask 
whether the "implies" clone is equivalent to the clone of Ism functions. We use the 
Mobius transform and the Fourier transform to show that these clones coincide pre- 
cisely up to arity 3. It is an intriguing open question whether the Ism clone is finitely 
generated. Finally, we investigate functional clones in which only restricted classes of 
unary functions are available. 
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1 Introduction 

In the classical setting, a (non- uniform) constraint satisfaction problem CSP(r') is specified 
by a finite domain D and a constraint language which is a set of relations of varying 
arities over D. For example, D might be the Boolean domain {0, 1} and F might be the set 
containing the single relation NAND = {(0, 0), (0, 1), (1, 0)}. An instance of CSP(r') is a set 
of n variables taking values in together with a set of constraints on those variables. Each 
constraint is a relation R from F applied to a tuple of variables, which is called the "scope" 
of the constraint. The problem is to find an assignment of domain elements to the variables 
which satisfies all of the constraints. For example, the problem of finding an independent 
set in a graph can be represented as a CSP with F = {NAND}. The vertices of the graphs 
are the variables of the CSP instance. The instance has one NAND constraint for each edge 
of the graph. Vertices whose variables are mapped to domain element 1 are deemed to be in 
the independent set. Constraint satisfaction problems (CSPs) may be viewed as generalised 
satisfiability problems, among which usual satisfiability is a very special case. 

The notion of expressibility is key to understanding the complexity of CSPs. A primitive 
positive formula (pp-formula) in variables V = {vi, . . . ,Vn} is a formula of the form 



where each atomic formula is either a relation R from F applied to some of the variables 
in V = {vi, . . . ,Vn+m} or an equality relation of the form Vi = vj, which we write as 
EQ{vi,Vj). For example, the formula 



is a pp-formula in variables vi and V2- This formula corresponds to the relation {(0, 0)} since 
the only way to satisfy the constraints is to map both vi and V2 to the domain element 0. 

The relational clone {F)-^ is the set of all relations expressible as pp-formulas over F. 
Relational clones have played a key role in the development of the complexity of CSPs 
because of the following important fact, which is described, for example, in the expository 
chapter of Cohen and Jeavons [H]: If two sets of relations F and F' generate the same 
relational clone, then the computational complexities of the corresponding CSPs, CSP(r') 
and CSP(r"), are exactly the same. Thus, in order to understand the complexity of CSPs, 
one does not to consider all sets of relations F. It suffices to consider those that are relational 
clones. 

Recently, there has been considerable interest in the computational complexity of count- 
ing CSPs (see, for example [H |TOl [13], [191 l20l [35]). Here, the goal is to count the number 
of solutions of a CSP rather than merely to decide if a solution exists. In fact, in order 
to encompass the computation of partition functions of models from statistical physics and 
other generating functions, it is common (see, for example, [9]) to consider weighted sums, 
which can be expressed by replacing the relations in the constraint language by real-valued 
or complex-valued functions. In this case, the weight of an assignment (of domain values 
to the variables) is the product of the function values corresponding to that assignment, 
while the value of the CSP instance itself is the sum of the weights of all assignments. If 
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I is an instance of such a counting CSP then this weighted sum is called the "partition 
function of /" (by analogy with the concept in statistical physics) and is denoted Z{I). For 
a finite set of functions F we are interested in the problem ^CSP{r), which is the problem 
of computing Z{I), given an instance / which uses only functions from F. 

Our first goal (see Q is to determine the most useful analogues of pp-definability and 
relational clones in the context of (weighted) counting CSPs (#CSPs), and to see what 
insight this provides into the computational complexity of these problems. It is clear that, 
in order to adapt the concept of pp-definability to the counting setting, one should replace 
a conjunction of relations by a product of functions and replace existential quantification 
by summation. However, there are sensible alternatives for the detailed definitions, and 
these have ramifications for the complexity-theoretic consequences. There is at least one 
proposal in the literature for extending pp-definability to the algebraic/functional setting 
— that of Yamakami [33]. However, we find it useful to adopt a more liberal notion of 
pp-definability, including a limit operation. Without this, a functional clone could contain 
arbitrarily close approximations to a function F of interest, without including F itself. We 
call this analogue of pp-definabihty "pps^^-definability" . The notion of pps^-definability leads 
to a more inclusive functional clone than the one considered in |35j . 

Aside from a desire for tidiness, there is a good empirical motivation for introducing 
limits. Just as pp-definability is closely related to polynomial-time reductions between clas- 
sical CSPs, so is pps^-definability related to approximation-preserving reductions between 
(weighted) counting CSPs. Lemma 17 is a precise statement of this connection. Many 
approximation-preserving reductions in the literature (for example, those in |2I]) are based 
not on a fixed "gadget" but on sequences of increasingly-large gadgets that come arbitrar- 
ily close to some property without actually attaining it. Our notion of pps^^-definability is 
intended to capture this phenomenon. 

The second, more concrete goal of this paper (see ^|3]-^ is to explore the space of 
functional clones and to use what we learn about this space to classify the complexity of 
approximating t^CSPs. We restrict attention to the Boolean situation so the domain is {0, 1} 
and the allowed functions are of the form {0, l}'^ — )■ ]R-° for some integer k. We examine the 
landscape of functional clones for the case in which all nonnegative unary functions (weights) 
are available. This case is known as the conservative case. It is also studied in the context 
of decision and optimisation CSPs [6l EH] and in work related to counting CSPs such as Cai, 
Lu and Xia's work on classifying "Holant*" problems [llj. The conservative case is easier to 
to classify than the general case, so we are able to construct a useful map of the landscape 
of functional clones (see Theorem 16). Note that Yamakami [33] has considered an even 
more special case in which all unary weights (including negative weights) are available. In 
that case the landscape turns out to be less rich and more pessimistic — negative weights 
introduce cancellation, which tends to drive approximate counting CSPs in the direction of 
intractability. 

An issue that turns out to be important in the classification of conservative functional 
clones is log-supermodularity. Roughly, a function with Boolean domain is said to be log- 
supermodular if its logarithm is supermodular. (A formal definition appears later.) It is a 
non-trivial fact (Lemma [?]) that the set LSM of log-supermodular functions is a functional 
clone (using our notion of pps^-definability). 

Conservative functional clones are classified as follows. A particularly simple functional 
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clone is the clone generated by the disequality relation. A counting CSP derived from this 
clone is trivial to solve exactly, as the partition function factorises. We say that functions 
from this clone are of "product form". Our main result (Theorem 16) is that any clone that 
contains a function F that is not of product form necessarily contains the binary relation 
IMP = {(0,0), (0, 1), (1, 1)}. This has important complexity-theoretic consequences, which 
will be discussed presently. Furthermore, (also Theorem 16), any non-trivial clone that 
contains a function F that is not log-supermodular actually contains all functions. Therefore 
a large part of the functional clone landscape is very simple. In particular, we have a complete 
understanding of the clones below the clone generated by IMP and of the clones above LSM. 
The complexity of the landscape of functional clones is thus sandwiched between the clone 
generated by IMP and the clone LSM. 



In order to derive complexity-theoretic consequences (see Theorem 18), we also present 
an efficient version of pps^-definability, and a corresponding notion of functional clone. The 
complexity-theoretic consequences are the third contribution of the paper. In order to de- 
scribe these, we need a quick digression into the complexity of approximate counting. The 
complexity class ^RHIIi of counting problems was introduced by Dyer, Goldberg, Greenhill 
and Jerrum [20] as a means to classify a wide class of approximate counting problems that 
were previously of indeterminate computational complexity. The problems in ^RHIIi are 
those that can be expressed in terms of counting the number of models of a logical formula 
from a certain syntactically restricted class. The complexity class ^RHIIi has a complete- 
ness class (with respect to approximation-preserving "AP-reductions" ) which includes a wide 
and ever-increasing range of natural counting problems, including: independent sets in a bi- 
partite graph, downsets in a partial order, configurations in the Widom-Rowlinson model 
(all pQ]) and stable matchings pE]. Either all of these problems admit a Fully Polynomial 
Randomised Approximation Scheme (FPRAS), or none do. The latter is conjectured. A 
typical complete problem in this class is #BIS, the problem of counting independent sets in 
a bipartite graph. 



Our complexity-theoretic results are presented in 510 As noted above, #CSP(J-') is 



computationally easy if every function in J-" is of product form. We show that, in every other 
(conservative) case, it is as difficult to approximate as #BIS. If, in addition, J-" contains a 
function F which is not log-supermodular, then the counting problem #CSP(J-') turns out 
to be universal for Boolean counting CSPs and hence is provably NP-hard to approximate. 
As immediate corollaries, we recover existing results concerning the complexity of computing 
the partition function of the Ising model [21]. 

Given the above discussion, one might speculate that the IMP-clone and LSM are the 
same. In fact, they are not. In ^ 11 , we examine the classes (LSM^) generated by Ism functions 
of arity at most k. We show that (LSM3) is equal to the IMP-clone, but we give a proof 
that (LSM3) is strictly contained in (LSM4). This mirrors the situation for VCSPs, where 
binary submodular functions can express all ternary submodular functions but not all arity 
4 submodular functions [36j . However, we do not know whether there is a fixed k such that 
LSM = (LSMfc). We conjecture that this is not the case. If LSM = (LSMfc) for some fc, there 
would still remain the question of whether LSM is finitely generated, i.e., whether it is the 
functional clone generated by some finite set of functions J-". We conjecture the opposite, 
that there is no such J-". 



Finally, in ^12 and ^13, we step outside the conservative case, and study functional 
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clones in which only restricted classes of unary functions are available. As might be ex- 
pected, this yields a richer structure of functional clones, including one that corresponds 
to the ferromagnetic Ising model with a consistent field, a problem that is tractable in the 
FPRAS sense [25J. We also exhibit in this setting two clones that are provably incompa- 
rable with respect to inclusion, even though their corresponding counting CSPs are related 
by approximation-preserving reducibility. These counting CSPs have natural interpretations 
as (a) evaluating the weight enumerator of a binary code, and (b) counting independent 
sets in a bipartite graph. This example shows that, in demonstrating intractability, it may 
sometimes be necessary to use reductions that go beyond pps^-definability. 

Although we focus on approximation of the partition functions of (weighted) #CSPs in 
this paper, there is, of course, an extensive literature on exact evaluation. See, for example, 
the recent survey of Chen [T3] . 

2 Functional clones 

As usual, we denote the natural numbers by N, the real numbers by M and the complex 
numbers by C. For n G N, we denote the set {1,2, ... ,n} by [n]. 

Let (^, +, x) be any subsemiring of (C, +, x), and let D be a finite domain. For n G N, 
denote by Un the set of all functions — )■ also denote by W = U Wi U W2 U ■ ■ ■ 
the set of functions of all arities. Note that we do not specify the domain, which we take 
to be understood from the context, in this notation. Suppose J-" C W is some collection of 
functions, V = {vi, . . . , Vn} is a set of variables and x : {vi, . . . , f„} — )■ D is an assignment to 
those variables. An atomic formula has the form ip = G{vi^, . . . , Vi^) where G E J^, a = a{G) 
is the arity of G, and {vij^,Vi^,. . . ,ViJ E V is a scope. Note that repeated variables are 
allowed. The function : — )■ ^ represented by the atomic formula ip = G{vi^, . . . ,ViJ 
is just 

F^{x) =G{x{viJ,...,x{viJ) = G{xi„...,XiJ, 

where from now on we write xj = x{vj). 

A pps-formula ( "primitive product summation formula" ) is a summatiorj^ of a product 
of atomic formulas. A pps-formula x(j in variables V O V' = {vi, . . . , Vn+m} over has the 
form 

s 

where cpj are all atomic formulas over J-" in the variables V. (The variables V are free, and 
the others, V \ V, are bound.) The formula ijj specifies a function : — ^ in the 
following way: 

s 

(2) F^{x)= J2 n^^.(^'2/)' 

yeD'" j = l 

^To avoid ambiguity, we will try to use "summation" of functions only with the meaning given here. Sums 
of different functions will be referred to as "addition" . 
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where x and y are assignments V ^ D, V \ V ^ D . The functional clone (J-") generated 
by J-" is the set of all functions in U that can be represented by a pps-formula over J-'U {EQ} 
where EQ is the binary equality function defined by EQ(a;,x) = 1 and EQ(a;,?/) = for 
X ^ y. We refer to the pps-formula as an implementation of the function. 

Since pps-formulas are defined using sums of products (with just one level of each), we 
need to check that functions that are pps-definable in terms of functions that are themselves 
pps-definable over J-" are actually directly pps-definable over J-". The following lemma ensures 
that this is the case. 

Lemma 1. If G E {T) then {T,G) = (J^). 

Note that, to simplify notation, we write (J-", G) in place of the more correct (J-" U {G}). 
More generally, we shall often drop set-brackets, replace the union symbol U by a comma, 
and confuse a singleton set with the element it contains. 

Proof of Lemma\^ Let J^' = U {EQ}. Suppose that is a pps-formula over F' U {G} 
given by 

(3) ^= E iiiAiii'i'n^ 

t)„ + l,...,D„ + m \i = l / \i=l 

where {^i} are atomic ^-''-formulas and {ipj} are atomic G-formulas in the variables V . 
Then 

(4) F^{x)= J2 [f[F^X^,y)] (f[F^^{x,y) 

y£D"^ \i=l J \j=l 

where x and y are assignments x : {t>i, . . . , t>„} — )■ D and y : {fn+i, • • • , Vn+m} D- Now, 
since G is pps-definable over J-"', and each tpj is an atomic G-formula in the variables V, we 
can write each ipj as 

t 

v^.+i,...,v^.+l k=l 

where i is the number of bound variables used in the definition of ipj {£ is independent of j), 
Uj = n + m + {j — 1)£ is the number of free variables plus the number of bound variables 
that are "used up" by ipi, . . . , ipj-i, and each ipj^k is an atomic J^'-formula over the variables 

V U {vu^+i, Vu^+e}- We get 

2/e_D'" \i=l / \j=l zieD* k=l 

= E E ■■■ E (UF.A-,y)] (flllF,^A-,y,z^ 

yeD"^ z^eD^ z=eD<^ \i=l 1 \j=lk=l 
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where each is an assignment : {f^j+i, . . . , v^^j^e} — > D. So 



Vn + l,---,'",^s+i \« = 1 / \j = l fc = l 



S t 

k 



is a pps-formula over J-"' for the function F^. □ 

To extend the notion of definabihty, we allow limits as follows. We say that an a-ary 
function F G W is pps^^-definable over if there exists a finite subset Sp of J-", such that, 
for every £ > 0, there exists an a-ary function F, pps-definable over Sp, such that 

||F - F||oo = max \F{x) - F{x)\ < e. 

Denote the set of functions that are pps^^-definable over J^U {EQ} by {J^)u}] we call this 
the pps^-definable functional done generated by J^. Observe that functions in {J^)ui are 
determined only by finite subsets of J-". Also, some functions taking values outside ^ may 
be the limit of functions pps-definable over J-". But they are not pps^-definable, since the 
function values of the limit must be in The domain ^ of the universal class of functions U 
in operation at any time will be clear from the context. 

The following lemma is an analogue of Lemma [TJ 

Lemma 2. If G e {J^)u) then {T,G)u, = (-F)^. 

Proof. Let J^' = U {EQ}. Suppose that H is an a-ary function in (J^, G)^. Let Sh be a 
finite subset of J-"' U {G} such that the following is true: Given e > 0, there exists an a-ary 
function H, pps-definable over Sh, such that \\H — H\\oo < sjl. Let be a pps-formula 
over Su representing H. For any function G with the same arity as G, denote by '?/'[G:=G] 
the formula obtained from i\} by replacing all occurrences of G by G. By continuity of the 
operators of pps- formulas, we know there exists 5 > such that, for every function G of the 
same arity as G,\G — G\oo < ^ implies 

This claim will be explicitly quantified in the proof of Lemma |4| but we don't need so much 
detail here. Of course, H = F^ so for each such G we have \\F_^^^.^qj — H\\oo < e/2. Now 
let Sg be the finite subset of J-"' used to show that G is pps^-definable over J^'. Let S = 
Sg U 5*^^ \ {G} C J^'. Choose a function G, pps-definable over Sg, satisfying HG — G||oo < 
Notice that G G (S) and F^ G {S, G) so F'^[G:=G] ^ (by Lemma and 

Since £ > is arbitrary, and 5 C J^' is finite, we conclude that H G {J^)uj- □ 

That completes the setup for expressibility. In order to deduce complexity results, we 
need an efficient version of {J^)u}- We say that a function F is efficiently pps^-definable over 
if there is a finite subset 5*^ of J-", and a TM Ai f,Sf with the following property: on input 
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e > 0, Ai f,Sf computes a pps-formula ^p over Sp such that has the same arity as F and 
\\F^ — F\\oo < e. The running time of Aip.Sp is at most a polynomial in \oge~^. Denote the 
set of functions in U that are efficiently pps^-definable over J^U {EQ} by {J^)ui,p', we call this 
the efficient pps^- definable functional clone generated by J-", 

The following useful observation is immediate from the definition of {J-')uj,p- 

Observation 3. Suppose F G {J^)uj,p (or F E (-7-")^). Then there is a finite subset Sp of J-" 
such that F G {Sf)uj,p (resp. F G {Sf)uj)- 

The following lemma is an analogue of Lemma [2] . 
Lemma 4. If G e {J^)^,p then {J^,G)^.p = {J^)uj,p- 

Proof. Let J-"' = J-" U {EQ}. Suppose H is an a-ary function in {J^',G)^^p. Our goal is to 
specify a finite subset S of J-"' and to construct a TM Mh,s with the following property: on 
input £ > 0, Mh,s should compute an a-ary pps-formula ip over 5" such that \\F^ — H\\^ < e. 
The running time of Mh,s should be at most a polynomial in log6~^. 

Let Sh be the finite subset of U {G} from the efficient pps^^-definition of H over 
J-"' U {G}. Given an input e/2, the TM Mh,Sh computes an a-ary pps-formula ip over Sh 
such that \\F^ — H\\oo < sj2. Write -0 as in Equation ([3]) so F^ is written as in Equation Q. 
Suppose that, for j G [s] and y G {0, 1}™, 5j,y(a5) is a function of cc. Consider the expression 

T(a.)= 5^ (nF^,(x,?/)) (n(F^^.(a.,?/) + 5,-,(a.)) 
\i=i / \j=i 



which can be expanded as 



j/eD™ 0CTC[s] iST 



where 



i=l j(.[s]\T 

Let C = max^,y,T \Cy,T{x)\ and let 5 = e2-'''+^^\D\-"'G-'^ < 1. 

Now let 5*^ be the finite subset of J-"' used to show that G is efficiently pps^^-definable 
over J^'. Given the input 6, the TM Mg,Sg computes a pps-formula ■?/' over Sq representing a 
function with the same arity as G such that — GHoo < 5- Since each ipj is an atomic 

G-formula in the variables V , we may appropriately name the variables of ip to obtain a 
pps-formula ifjj over S'c such that — F^^W^o < S. 

For y G il''", let 6j,y{x) = F^ {x,y) — F^^{x,y) and note that \5j,y{x)\ < 5. Let 
S = Sg^ Sh \ {G} C J^'. Let ip' be the formula over S formed from by substituting each 
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occurrence of ipj with ipj. Let (p be the pps- formula over 5* for the function F^' which is 
constructed as in the proof of Lemma [T] From the calculation above, 

1 1 -^ip -^Ip 1 1 OO II 1 1 

= max \F^'{x) — F^{x)\ 

X 

= max |T(a3)| 

X 

< 2'\DrCS = e/2. 

Note that 

1 1 -^tp 1 1 OO — II -^ip 1 1 OO II -^^p 1 1 OO ^ ^ • 

Thus, the formula ip is an appropriate output for our TM Mh,s- 

Finally, let us check how long the computation takes. The running time of Mh.Sh 
most poly(log£~^). Since this machine outputs the formula ip, we conclude that m and r and 
s are bounded from above by polynomials in loge~^. Let A be the ceiling of the maximum 
absolute value of any function in Sh- Note that C < A^^^. The running time of Mq^Sg 
at most poly(log(5~^)), which is at most a polynomial in m + s + log(C) + log(£~^) which is 
at most a polynomial in log(£:^^). Finally, the direct manipulation of the formulas that we 
did (renaming variables from ip to obtain tpj and producing the pps-formula ip from ip and 
the ipj formulas) takes time at most polynomial in the size of ip and ip, which is at most a 
polynomial in log(e~^). □ 

Lemma |4] may have wider applications in the study of approximate counting problems. 
Often, approximation-preserving reductions between counting problems are complicated to 
describe and difficult to analyse, owing to the need to track error estimates. Lemma |4] 
suggests breaking the reduction into smaller steps, and analysing each of them independently. 
This assumes, of course, that the reductions are pps^-definable, but that often seems to be 
the case in practice. 



3 Relational clones and nonnegative functions 

A function F G W is iJoo/ear?]^ if its range is a subset of {0, 1}. Then F encodes a relation R 
as follows: x is in the relation R iff F{x) = 1. We will not distinguish between relations and 
the Boolean functions that define them. Suppose that 7?. C W is a set of relations/Boolean 
functions. A pp-formula over TZ is an existentially quantified product of atomic formulas (this 
is called an 3CNF(7?.)-formula in [16]). More precisely, a pp-formula ip over TZ in variables 
V = {f 1, . . . , Vn+m} has the form 

s 

Ip = 3Vn+l, . . . ,Vn+m /\ 'Pj, 

i=i 

•^Note that "Boolean" applies to the codomain here, not the domain. As noted earher, all of the functions 
that we consider have Boolean domains. This usage of "Boolean function" is unfortunate, but is well 
established in the literature. When the range is not a subset of {0, 1} we emphasise this fact by referring to 
the function as a "pseudo-Boolean" function. 
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where cpj are all atomic formulas over TZ in the variables V. As before, the variables V = 
{vi, . . . ,Vn} are called "free", and the others, V \ V, are called "bound". The formula ip 
specifies a Boolean function : — > {0, 1} in the following way. R^{x) = 1 if there is a 
vector y G -D"^ such that /\!j^iRipj{x,y) evaluates to "1", where x and y are assignments 
X : {f 1, . . . , Vn} — )■ D and y : {f„+i, . . . , fn+m} — ^ D; R^{x) = otherwise. We call the 
pp-formula an implementation of R^. 

A relational clone (often called a "co-clone") is a set of relations containing the equality 
relation and closed under finite Cartesian products, projections, and identification of vari- 
ables. A basis [16] for the relational clone / is a set TZ of Boolean relations such that the 
relations in / are exactly the relations that can be implemented with a pp-formula over TZ. 
Every relational clone has such a basis. 

For every set TZ of Boolean relations, let (7^)r denote the set of relations that can be 
represented by a pp-formula over TZ U {EQ}. It is well-known that if R E (7^)r then 
(T^U {-R})r = {TZ)^ (This can be proved similarly to the proof of Lemma [l}) Thus, (7^)r is 
in fact a relational clone with basis TZ. 

A basis TZ for a relational clone (7^)r is called a "plain basis" fl6[ Definition 1] if every 
member of (7^)r is definable by a CNF(7^)-formula (a pp-formula over TZ with no 3). 

Pseudo-Boolean functions [S] are defined on the Boolean domain D = {0, 1}, and have 
codomain = M, the real numbers. For n G N, denote by ^„ the set of all functions 
{0, 1}" M, and denote the set of functions of all arities by =^ = U U U ■ ■ ■ . Note 
that any tuple x G {0, 1}" is the indicator function of a subset of [n]. We write \x\ for the 
cardinality of this set, i.e. \x\ = Yl^j=i^i- 

For most of this paper, we restrict attention to the codomain = M-° of nonnegative 
real numbers. Then Bn is the set given by replacing M by ]R-° in the definition of and 
then B is defined analogously to We will also need to consider the permissive functions 
in B. These are functions which are positive everywhere, so the codomain ^ = ]R^°, the 
positive real numbers. Thus B^^ and B^^ are given by replacing M by ]R^° in the definitions 
of and ^. 

The advantage of working with the Boolean domain is (i) it has a well-developed theory 
of relational clones, and (ii) the concept of a log-supermodular function exists (see §4). As 
explained in the introduction, the advantage of working with nonnegative real numbers is that 
we disallow cancellation, and potentially obtain a more nuanced expressibility/complexity 
landscape. 

Given a function F E B, let Rp be the function corresponding to the relation under- 
lying F. That is, Rpix) = if F{x) = and Rpix) = 1 if F{x) > 0. The following 
straightforward lemma will be useful. 

Lemma 5. Suppose J-" ^ B. Then 

{{Rf I F G ^})r = {RF\Fe (T)}. 

Proof. Let be a subset of B. First, we must show that, for any R G {{Rf \ F G J^})r, R 
is in {Rf\F e 

Let ip be the pp-formula over {Rf \ F G J-"} U {EQ} that is used to represent R. Write 
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^Ij as 

s 

i=i 

where Fj is an arity-a^ function in J-" U {EQ}, and the index function «(■,■) picks out an 
index in the range [1, n + m], and hence a variable from V = {vi, . . . , Vn+m}- Let ip' be the 
pps-formula over J-" U {EQ} given by 

s 

Let F' = F^/. Note that F' E (J") and that Rp' = R. 

By reversing this construction, we can show that, for any R G {Rp \ F G (J^)}, R is in 
{{Rf I F G ^})r. □ 



4 Log-supermodular functions 

A function F G i3„ is log-supermodular (Ism) if F{x V y)F{x A y) > F{x)F{y) for all 
x,y E {0, 1}". The terminology is justified by the observation that F G B^^ is Ism if and 
only if / = logF is supermodular. We denote by LSM C i3 the class of all Ism functions. The 



second part of our main result (Theorem 16) says that, in terms of expressivity, everything 



of interest takes place within the class LSM. Consequently, in ^11, we will investigate the 
internal structure of LSM. 

Note that Bq,Bi C LSM, since log-supermodularity is trivial for nuUary or unary func- 
tions, and hence the class LSM is conservative. And it fits naturally into our study of 
expressibility because of the following closure property: functions that are pps^-definable 
from Ism functions are Ism. The non-trivial step in showing this is encapsulated in the 
following lemma. It is a special case of the Ahlswede-Daykin "four functions" theorem [I]. 
However [1] is a much stronger result than is required, so we give an easier proof, using an 
argument similar to the base case of the induction in [1]. 

Lemma 6. If G E Bn+m, let G' E Bn be defined by G'{x) = Xlze{o i}™ ^(*' '^)- Then 
G E LSM implies G' E LSM. 

Proof. By symmetry and induction, it suffices to consider summation on the last variable. 
Thus, let {x,Xn+i), {y,yn+i) e {0, l}"+\ and let 

a, = G{x, z), (3, = G{y, z), = G{x Vy,z), 6, = G{x A y, z) {z E {0, 1}). 

Then we must show that G G LSM implies 

(ao + ai)(/3o + /3i) < (7o + 7i)(<5o + 

which expands to 



(5) 



aoPo + aiPo + aoPi + < 70^0 + 70^1 + 7i'^o + 7i^i- 
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Since G G LSM, we have the foUowing four inequahties, 

(6) ao(3o < 70^0, (7) «o/3i < 71*^0, (8) ai/3o < 7i^o, (9) aiA < 7i'^i- 
We will complete the proof by showing that (|6]) to (|9]) imply (|5| for arbitrary nonnegative 

real numbers a^, f3z, 7^, Sz {z G {0, 1}). 

Using (|6| and ([9]), it follows that ([s]) is implied by 

(10) ai/3o + ao/3i < 70^1 + 7i'^o- 

Observe that 71^0 = implies (10), since the left side is zero by ^ and ([s]). Thus we may 
assume 71 (5o > 0. 

Now, using (@ and (g again, ( [lO| is imphed by 

(TU ^ R R <r («o/3o)(«i/3i) , X («i/^o)(«o/^i) , X 

(11) ai/5o + aoPi < ? h7i'Jo = ? h 7i'Jo- 

c>o7i 7i'Jo 



Now (11) can be rewritten as 

< (7i'^o - tti/3o)(7i^o - "o/3i), 
which is implied by ([T]) and (jsj). □ 
Lemma 7. // C LSM then (J^)^ C LSM. 

Proof. We just need to show that each level in the definition of pps^^-definable function 
preserves Ism: first that every atomic formula over J-" U {EQ} defines an Ism function, then 
that a product of Ism functions is Ism, then that a summation of an Ism function is Ism, 
and finally that a limit of Ism functions is Ism. As we shall see below, only the third step is 
non-trivial, and it is covered by Lemma |6j 

First, note that the EQ is Ism, so every function in J-'U {EQ} is Ism, An atomic formula 
if = G{vi-^, . . . , ViJ defines a function F^{x) = G{xi^, . . . ,Xi^) which is Ism: 

F^{xV y)F^{x Ay) = G{xi^ Vyi^,...,Xi^ VyiJG{xi^ Ayi^,...,x^^ AyiJ 
> G{xi^,...,XiJG{yi,,...,yiJ 
= F^{x)F^{y). 

Note that we do not need to assume that ii, . . . ,ia are all distinct. 

It is immediate that the product of two Ism functions (and hence the product of any 
number) is Ism. Thus the product 11^=1 appearing in ^ is Ism. Then, by Lemma [6| the 
pps-definable function F^ in ^ is Ism. 

Finally, we will show that any function that is approximated by Ism functions is Ism. 
Suppose that a function F G S„ has the property that, for every e > 0, there is an arity-n 
Ism function F satisfying 

||F-F||oo= max \F(x) - F(x)\ < e. 

a;e{0,l}° 

We wish to show that F is Ism. Let -Fmax = max3,F(a;). Suppose for contradiction that F 
is not Ism, so there is a 5 > and x,y & {0, 1}" such that 

F{x V y)F{x Ay) < F{x)F{y) - 6. 
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Let e > be sufficiently small that e max(Fmax, 1) is tiny compared to 5. Then 

F{x V y)F{x Ay) < {F{x y y) + e){F{x Ay)+e) 

< F{x V y)F{x Ay) + 2eF^,, + 

< F{x)F{y) - 5 + 2eF^,^ + 

< {F{x) + e){F{y) + e) - 5 + 2eF^,^ + 

< F{x)F{y) + 2£(F^ax + s) - 5 + 2eF^,^ + 2e^ 

< F{x)F{y), 

so F is not Ism, giving a contradiction. □ 
An important example of an Ism function is the 0,1-function "implies". 



IMP(x,y) 



0, if (x,y) = (l,0); 

1, otherwise. 



We also think of this as a binary relation IMP = {(0, 0), (0, 1), (1, 1)}. Complexity-theoretic 
issues will be treated in detail in ^|TOj However, it may be helpful to give a pointer here to 
the importance of IMP in the study of approximate counting problems. 

The problem #BIS is that of counting independent sets in a bipartite graph. Dyer et 
al. [18] exhibited a class of counting problems, including #BIS, which are interreducible via 
approximation-preserving reductions. Further natural problems have been shown to lie in 
this class, providing compelling evidence that is of intermediate complexity between counting 
problems that are tractable (admit a polynomial-time approximation algorithm) and those 



that are NP-hard to approximate. We will see in due course (Theorem 18 and Proposition 36) 
that #BIS and #CSP(IMP) are interreducible via approximation-preserving reductions, and 
hence are of equivalent difficulty. 

We know from Lemma IT] that (IMP, C LSM, and one might ask whether this inclu- 



sion is strict. We will address this question in ^11 



5 Pinnings and modular functions 

Let 5q be the unary function with 5o(0) = 1 and 5o(l) = and let 5i be the unary function 
with 5i(0) = and 5i{l) = 1. 

Let S C [n], let x' = {xj)jfs, and x" = {xj)j^Sy and partition x G {0,1}" as {x';x"). 
Then, if F G the function F{x']c) given by setting = Cj for constants Cj G {0,1} 
(j G S*) is a pinning of F. Note that we allow the empty pinning S* = 0, which is F itself, 
and the pinning of all variables S = [n], which is a nullary function. 

Clearly, every pinning of F is in {F, Sq, Si), since a constant c G {0, 1} can be implemented 
using either Sq or Si, i.e. we add Sc{xi) to the constraint set. We will use the notation i c 
to indicate that the ith variable has been pinned to c. 

If n > 2 then a 2-pinning of a function F G i3„ is a function F{xi,Xj; c) which pins all 
but 2 of the variables. Thus [n] \ = where i and j are distinct indices in [n], and 

c G {0, 1}"~^. Clearly, every 2-pinning of F is in {F, B\), since it is a pinning of F. 
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We say that a function F G i3„ is log-modular if F{x V y)F{x Ay) = F{x)F{y) for all 
a;, 7/ G {0,1}". 

It is a fact that LSM and the class of log-modular functions are closed under pinning. This 
is a consequence of the following lemma about 2-pinnings of Ism and log-modular functions. 
It is due, in essence, to Topkis [31j, but we provide a short proof for completeness. 

Lemma 8 (Topkis). A function F G B^^ is Ism iff every 2-pinning is Ism, and is log-modular 
iff every 2-pinning is log-modular. 

Proof. The necessity of the 2-pinning condition is obvious, but we must prove sufficiency. 
We need only show F{x)F{y) < F{x V y)F{x A y) for x,y such that Xi ^ yi [i & [n]). 
All other cases follow from this. Note that log-supermodularity is preserved under arbitrary 
permutation of variables. Thus, if O'', 1*" denote r-tuples of O's and I's respectively, we must 
show that, for all r, s > with r + s = n, 

(12) F(0^ r)F{l\ 0") < F(r , r)F(0^ 0"). 

We will prove this by induction, assuming it is true for all r',s' > such that r' + s' < n. 
The base case, r' = s' = 1, is the 2-pinning assumption. If r > 1, then we have 

(13) F(0", l^)F(r-\ 0"+^) < F(r-\ 0, V)F{fd\ O'^), 



by induction, after pinning the rth position to 0, 

(14) F(r"\ 0, i^)F(r , 0") < F(r , r)F(r~\ o"+^) 

by induction, after pinning the first r — 1 > positions to 1. 



Now, multiplying (13) and (14) gives (12) after cancellation, which is valid since F is 



permissive. If r = 1, we do not have the induction giving ( |14[ ), so we use instead 
(15) F(l, 0')F(0, 1, 0"-^) < F(0, l')F(l, 1, 0^"^), 



by induction, using the base case after pinning the last s — 1 positions to 0, 



(16) 



F(l, 1, 0^~^)F(0, 1") < F(l, r)F(0, 1, 0' 



by induction, after pinning the second position to 1. 



Now, multiplying (15) and (16) gives (12) after cancellation, completing the proof for 



log-supermodularity. The proof for log-modularity is identical, except that every "<" must 
be replaced by "=" . □ 

Remark 9. We have proved Lemma |8] only for permissive functions because, in fact, it is 
false more generally. Consider, for example, the function F & B4 such that F(l, 1,0,0) = 
F(0, 0,1,1) = 1, F{x) = otherwise. It is easy to see that all the 2-pinnings F' of F 
have F'{x,y) > for at most one {x,y) G {0,1}^. It follows that every 2-pinning of F is 
log-modular. But F is not even Ism, since F(l, 1, 0, 0)F(0, 0, 1, 1) > F(l, 1, 1, 1)F(0, 0, 0, 0). 
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6 Computable real numbers 

Since we want to be able to derive computational results, we will now focus attention on 
functions whose co-domain is restricted to efficiently-computable real numbers. We will 
say that a real number is polynomial-time computable if the n most significant bits of its 
binary expansion can be computed in time polynomial in n. This is essentially the definition 
given in [27]. Let W denote the set of nonnegative real numbers that are polynomial-time 
computable. For n G N, denote by B"^ the set of all functions {0, l}" — )■ also denote by 
= Bq U U B2 U ■ ■ ■ the set of functions of all arities. 

Remark 10. As we have defined them, it is known that the polynomial-time computable 
numbers form a field [27], and hence a subsemiring ^ of C, as we require. Thus, in our 
definitions, there is no difficulty with pps-definability. However, there could be a problem 
with pps^-definability, since the limit of a sequence of polynomial-time computable reals 
may not be polynomial-time computable. Polynomial-time computability is ensured only by 
placing restrictions on speed of convergence. See [271 132] for details. However, observe that 
our definition of efficient pps^- definability avoids this difficulty entirely, by insisting that 
the limit of a sequence of reals will be permitted only if the limit is itself polynomial-time 
computable. 

Remark 11. The polynomial-time computable real numbers are a proper subclass of the 
efficiently approximable real numbers, defined in [22]. (This fact can be deduced from [27].) 
We have made this restriction since it results in a more uniform treatment of limits when 
we discuss efficient pps^-definability for functions in B^. 



7 Binary functions 

We begin the study of the conservative case in the simplest nontrivial situation. We consider 
the functional clones {F, B\)^^p, where F is a single binary function. 

Recall that EQ is the binary relation EQ = {(0, 0), (1, 1)}. (We used the name "EQ" to 
denote the equivalent binary function, but it will do no harm to use the same symbol for 
the relation and the function.) Denote by OR, NEQ, and NAND the binary relations OR = 
{(0,1), (1,0), (1,1)}, NEQ = {(0,1), (1,0)}, and NAND = {(0, 0), (0, 1), (1, 0)}. When we 
write a function F G B2, we will identify the arguments by writing F{xi,X2)- We may 
represent F by a 2 x 2 matrix 



M{F) 



'F(0,0) 


F(0,1)" 




/oo 


/oi 


F(1,0) 






/lO 


/ll 



say, with rows indexed by Xi G {0, 1} and columns by X2 G {0, 1}. We will assume /oi > 
/lo, since otherwise we may consider the function F^, such that F^(xi,X2) = F(x2,xi), 
represented by the matrix M{F)^. Clearly {F'^) = (F). 

If ?7 is a unary function, we will write U = (t/(0), U{1)) = {uo,ui), say. Then we have 



M{Uix,)Fix^,X2)) 



Mo/oO 
Ml/10 



Mo/01 
Ulfll 



M{U{X2)F{X^,X2)) 



Mo /go 

Mo/10 



Ml/01 
Ml/11 
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where both U{xi)F{xi,X2) and U{x2)F{xi,X2) are clearly in {F,U). 

If Fi, F2 e B2, then M{Fi)M{F2) = M{F), where F e (Fi, F2) is such that 

1 

F{xi,X2) = ^Fi(xi,y)F2(y,X2). 

y=o 

Lemma 12. Let F e Bf . Assuming /oi > /lO; 

«//oo/ii = /oi/io, ^/ien = (B?),,^; 

/oi, /lo = ano? /oo, /n > 0, ^/len (F, Bf)^,,, = {^i)cj,p; 

(in) if /oo, /ii = and U, /lo > 0, then (F, B?),,^ = (NEQ, Bf 

r^^y* «//oo,/oi,/ii > and /oo/ii > /oi/io, ^/^en {F,B\)^^p = (IMP,B^)^,p; 

(v) otherwise, {F,Bi)uj,p = {OR,B^)uj,p- 

The non-efficient version — with Bi,B2 replacing B^jBg, and replacing {■)uj,p — also 
holds. 

Proof. To prove (Fi,Bf)(j,p = (F2,Bf);^,p, it suffices to show that F2 e {Fi,Bl)ui,p and Fi e 
(F2, Bi)uj,p. We will verify this in each of the five cases. 

(i) Suppose /oo/ii = /oi/io- If /oo, /oi = 0, then 

F{xi,X2) = Ui{Xi)U2{x2) 

with C/i = (0, 1) and U2 = (/lo, /ii)- Similarly if /oo, /lo = 0, foi, fn = 0, or /lo, /n = 
0. In the remaining case /oo, /oi, /lo, /ii > 0. Then choose Ui = (l,/io//oo), U2 = 
(/oo, /oi). In aU cases F e (t/i, t/2), so (F, B^..^ = (B?),,p. 

(ii) if /oi, /lo = and /oo, /ii > 0, then F{xi, X2) = C/(xi)EQ(xi, X2), where C/ = (/oo, /ii), 
so F e ([/). Hence (F,Bf),,p = (B?),,p. 

(iii) If /oo,/ii = and /oi,/io > 0, then F{xi,X2) = C/(a;i)NEQ(a;i, 2:2), where U = 
(/oi,/io), so F G (NEQ,f/). Similarly NEQ(a;i,a;2) = f/'(a;i)F(xi, X2), where U' = 



(l//oi,l//io), so NEQ e (F,[/'). So (F,BO.,p = (NEQ,B? 



■p\ 

oj,p- 



(iv) If /oo,/oi,/ii > 0, /oo/ii > /oi/io, we can apply unary weights Ui,U2, where C/i = 
(l//oo,/oi//oo/ii), U2 = (l,/oo//oi), to implement IMPa(a;i,X2) = Ui{xi)U2{x2)F{xi, X2), 
where 

M(IMP,) . 



a 



where a = /01/10//00/11 < 1- Then we have IMF^, G {F,Ui,U2). Note that IMPo = 
IMP. If q; > 0, consider the function IMP^, with matrix 



M(IMP^) = 



1 1 
a'' 1 



Now IMP^ can be implemented as lMP^(a;i, a;2) = Ui{xi)U2{x2)F'^(xi, X2), by taking 
k copies of Ui, U2 and F. Since a < 1, we see that limfc^ooIMP^ = IMPo = IMP. 
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Moreover, the limit is efficient, since ||IMP - IMP^II < e if = 0(log£-^), and so an 
e- approximation to IMP can be computed in 0(log£:~^) time. Hence IMP G {F, B{)ui,p- 

Note that "powering" limits like that used here will be employed below without further 
discussion of their efficiency. 

Conversely, from IMP, we ffist implement IMPq,. If a = 0, we do nothing. Otherwise, 
we use unary weights Ui, U2 such that Ui = (1/a — 1, 1), U2 = (a, 1), to implement 
Fi{xi,X2) = Ui{xi)U2{x2)lMF{x2,xi), where 



M{Fi 



1-a 

a 1 



Then M(IMP„) = M(IMP)M(Fi), so IMP« G (IMP, t/i, f/2). Now we can recover 
F{xuX2) = f/3(a;i)f/4(a;2)IMP„(a;i,a;2), where Us = (/oo, /00/11//01), U4 = (l,/oi//oo), 
so we have F G (IMP, Ui, U2, U3, U4). Hence (F, i3f)^,p = (IMP, Bl)^,p. 

The remaining cases are (a) /oi,/io,/ii > 0, /oo/ii < /oi/io and (b) /oo, /oi, fio > 0, 
/ll = 0. 

First, we deal with part (a): If /oi,/io,/ii > and /oo/n < /oi/io, we apply 
unary weights Ui,U2, where Ui = (/11//01, 1), U2 = (I//10, 1//11), to implement 
0Rc,(a;i,X2) = f/i(xi)t/2(x2)F(xi, X2), where a = /00/11//01/10 < 1, and 



M(ORo 



a 1 
1 1 



If a = 0, ORo = OR, so we have OR G (F, B^). Otherwise limfc_^oo OR^ = ORq = OR, 
so we have OR G {F, B\)ui,p- 

Conversely, from OR, we ffist express NEQ. Use the unary function U = (2, 1/2) to 
implement Fi = [/(a;i)f/(x2)0R(xi, X2), where 



M(Fi 



1 ■ 

1 1/4 



Then lim^^oo F^ = NEQ, so NEQ G (OR, B{)^^p. Now we observe that M(IMP) 



IV 



M(NEQ)M(OR), so IMP G {OR, B^^^p. Now we have IMP„ G (OR, i3f as in 

above. Then M(OR„) = M(NEQ)M(IMP„), so 0R« G {OR, B^^)^,p. Now we ^ 
reverse the transformation from F to ORq, to recover F. 

Now, we consider part (b): If /00,/oij/io > and fu = 0, we apply unary weights 
Ui,U2, where Ui = (I//00, 1//10), U2 = (l,/oo//oi), to implement NAND(xi,X2) = 
Ui{xi)U2{x2)F{xi, X2) , where 



M(NAND) 



1 1 
1 



so we have NAND G {F,B'{). We now use unary weight U = (V2,2) to implement 
Fi(xi) = f/(xi)t/(xi)NAND(xi,X2) with 



M(Fi 



1/4 1' 
1 
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Then we have hmfc_s.oo -^i^ = NEQ, so again NEQ G {F, B{)ui,p- Then we observe that 
M(OR) = M(NEQ)M(NAND)M(NEQ), so OR G (F, i3f 

Conversely, from OR, we have NEQ G (OR, B\)u),p from the above. Then we have 
M(NAND) = M(NEQ)M(OR)M(NEQ), so NAND G (OR, i3^)<^,p. Now we reverse 
the transformation above from F to NAND to recover F. Thus F G (OR, B\)u},p- □ 



Remark 13. From Lemma [121 we see that IMP does not really occupy a special position in 
(IMP,i3^)^p, in the sense that there are other functions F with {F,Bi)^^p = {lMP,B{)^^p. 
Similarly, OR does not occupy a special position in (OR, B\)i^^p. Nevertheless, it is useful to 
label the classes this way, and we will do so. 



Remark 14. From the proof of Lemma 12, we have the following inclusions between the four 
classes involved. 



{B^).,p ^ (IMP,i3D.,p ^^^^'^1^-'^- 



In fact, (NEQ, Bi)i^^p and (IMP, Bi)uj,p are incomparable, and hence all the inclusions are 
actually strict. For one non-inclusion, note the clone (IMP, B\)u),p contains only Ism func- 
tions, and hence does not contain NEQ. For the other, we claim that any binary function in 
the clone (NEQ, i3]')(^^p has one of three forms, Ui{x)U2{y), U{x)EQ{x,y) or [/(a;)NEQ(x, y), 
and then observe that IMP matches none of these. The claim is a special case of a more 
general one, namely that any function in (NEQ, Bl)uj,p is of the form F^, where is a product 
of atomic formulas involving only unary functions and NEQ. To show this, we need only 
consider summing over a single variable. By induction, assume we have a pps-formula of the 
form -,^1 F^(a;, 2;)F^(a;, ?/), where F^ is a product of atomic formulas involving y (and 

certain other variables x), and F^ is a product not involving y. Then 

k 

J2 F^{x,z)F4x,y) = F^{x,z) ^ f/(y) J] NEQ(x„ 
ye{o,i} ye{o,i} i=i 

k 

= F^{x,z)U{xi)Y[^Q,ixi,Xi), 

i=l 

where U{xi) = U{1 — xi). The product of equalities can be removed by substituting Xj 
{j = 2, . . . , k) by Xi in ip{x, z), continuing the induction. 

Finally, it is straightforward to show that this class is closed under limits. That is, the 
limit of a sequence of functions which are products of unary and NEQ functions must itself 
be of this form. To see this, note that every fc-ary function in this class can be written as 
a product of 0(A;^) unary and NEQ functions. Then the conclusion follows by a standard 
compactness argument. The efficient version also follows. 



8 The class (OR, Bl)u;,p 

In the conservative case, we show that, somewhat surprisingly, the clone generated by the 
single binary function OR contains every function in B^. 
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Lemma 15. (OR,^^..,^ = B^. 

Proof. Suppose F G Bf^. Suppose variables. For each AC [n], let 1a be the 

assignment to Xi, . . . , in which Xj = 1 if i G A and Xj = otherwise. Then we will use the 
notation F{A) as shorthand for F{1a)- Let A= {A : F{A) > 0}, and let /x = min^g_4 
For any AC [n], let ua € B{ be the function such that ua{0) = 1 and ua{1) = 2F{A)/ fi—1 > 
1. Note that every function ua is in and we have IMP,NAND G (OR, i3^)a;,p, from the 



proof of Lemma 12 



Our goal will be to show that there is a finite subset Sp of {IMP, NAND} U i3f and a 
TM Mp^Sp with the following property: on input e > 0, Mp^Sp computes an arity-n pps- 
formula ip over Sp such that ||-F^ — -^||oo < ^- The running time of Mp^Sp should be at 
most a polynomial in loge~^. To define Sp, we will use two unary functions Ui and U2 
(both of which are actually constant functions). We define these by Ui{0) = Ui{l) = 1/2 and 
f/2(0) = f/2(l) = /i/2. Then Sp = {IMP, NAND, [/i, U2} U [Ja^aM- 

Let V = {vi, . . . , Vn}- For A G ^, introduce a new variable 2;^. Let V" = V U {za \ A G 
A}. Let 

V" XAeA / VieA / Vi^A / 

For every A G ^ the assignment x = 1a can be extended in two ways (both with za = 
and with = 1) to satisfy 

(17) ( J]lMP(z^,x,) j ( J]NAND(zA,a;0 ) =1- 

VieA / \i4A J 



Any other assignment x can be extended in only one way {za = 0) to satisfy (17). So if 
Ae A then 

F^M) = (2F(A)//x - 1) + 1 = 2F(A)//i. 
On the other hand, ii A ^ A then 

We have shown that F^^ G {Sp). Let us now define 



^2=e( nni^p('^-4,x,)) (nNAND(^ 

y" XA&AiaA / VirfA 



As before, for every A & A the assignment x = 1a can be extended in two ways {za = 



and Za = 1) to satisfy (17), and any other assignment x can be extended in only one way 
{za = 0) to satisfy it. So 

F^,{A) = 2 (AeA), F^,{A) = 1 (A^A). 

Thus G (Sp). Now define F3 by ^3(7!) = Ui{xi)F^^{A), so F3 G (5^), where 

FsiA) = l (AeA), Fs{A) = y2 (A^A). 
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Now limfc^,oo = Fq, where 

Fo{A) = l (AeA), Fo{A) = (A^A), 

and thus Fq G {Sf)i^^p. 

Note that Fq = Rp, the underlying relation of F. Now define F4 = F^^Fq, so that 

Fi{A) = 2F{A)/n {A e A), F^^A) = {A ^ A), 

Thus, by Lemma |4[ F4 e {Sf)uj,p- Now define F5 by F^iA) = L/'2(xi)F4(A), so F5 e {Sf)c,,p, 
where 

F,{A)=FiA) (AeA), F,{A) = (A^^A). 
Since F5 = F, the proof is complete. □ 

9 The main theorem 

In this section we prove our main structural result, which characterises, in the conservative 
case, the clones generated by a single pseudo-Boolean function. Since it is known that any 
clone generated by a finite set of functions can be generated by a single function |!7J, we have 
a characterisation of all finitely generated functional clones. 

Theorem 16. Suppose F & B^. 

• If F ^ (NEQ,i3?) then IMP G {F.BD^^p, and hence (IMP,i3?)^,p C {F,Bl)^^p 

• If, m addition, F ^ LSM then {F,Bl)^^p = Bp. 

The non-efficient version — with B,Bi replacing B^^B^, and (•)^ replacing {■)u,p — clIso 
holds. 

Proof. We start with the first part of the theorem. The aim is to show that either IMP G 
(F, Bi)u},p or F G (NEQ, B^). Let C be the relational clone {Rp, 5o, '^i)r- Since {Rf, (^o, ^i} ^ 
{Rf> I F' G {F} U B{}, C C {Rf^ I F' G {F} U -B^)r, so by Lemma § C C {i?^, | F' G 

{FM)}- 

First, suppose IMP G C. Then {F,Bl)uj,p contains a function F' such that Rf' = IMP. 



The function F' falls into parts (iv) or (v) of Lemma 12, so by this lemma, {F, B{)u],p is either 
(IMP, -Bf or (OR,-Bf)^,p. Either way, {F,Bl)^,p contains IMP (as noted in Remark |l4l). 
Similarly, if OR G C or NAND G C then IMP G (F, )^,p. 

We now consider the possibilities. If Rf is not affine, then Creignou, Khanna and Sudan 
Lemma 5.30] have shown that one of IMP, OR and NAND is in C. This is also stated and 
proved as jJOl Lemma 15]. In fact, the set of all relational clones (also called "co-clones") is 
well understood. They are listed in [T6| Table 2], which gives a plain basis for each relational 
clone. There is a similar table in |1] (though the bases given there are not plain). A Hasse 
diagram illustrating the inclusions between the relational clones is depicted in |3l Figure 2] . 
This diagram is reproduced here as Figure [T] A downwards edge from one clone to another 
indicates that the lower clone is a subset of the higher one. For example, since there is a 
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Figure 1: Post's lattice from |3l Fig. 2]. 



path (in this case, an edge) from IDi down to IR2 in Figure [T| we deduce that IR2 C IDi. 
We will not require bases for all relational clones, but we have reproduced the part of [T6| 
Table 2] that we use here as Table [l} 

U Rp is affine then the relations in C are given by linear equations, so C is either the 
relational clone IL2 (whose plain basis the set of all Boolean linear equations) or C is some 
subset of IL2, in which case it is below IL2 in Figure [TJ 

Now, EQ, 60 and 61 are in C. The relational clone containing these relations (and nothing 
else) is IR2, so C is a (not necessarily proper) superset of IR2. Thus, C is (not necessarily 
strictly) above IR2 in Figure [l] From the figure, it is clear that the only possibilities are that 
C is one of the relational clones IL2, IDi and IR2. 

Now IR2 C IDi and the plain basis of IDi is {EQ, NEQ, 6q, 61}. Therefore if C = IR2 or 
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Plain Basis 


IR2 

IDi 
IL2 


{EQ,5o,5i} 

{EQ,NEQ,5o,5i} 

{{(xi,...,Xfc)e{0,l}'= 


xi + . . . + Xk = c (mod 2)} 


keN,ce {0,1}} 



Table 1: The relevant portion of Table 2 of Some relational clones and their plain bases. 



C = ID I, then Rp is definable by a CNF formula over {EQ, NEQ, 60, Si}. 

Suppose that F{x) has arity n and, to avoid trivialities, that Rp is not the empty 
relation. Suppose that ip{vi, . . . , f„) is a CNF formula over {EQ, NEQ, 60, 61} implementing 
the relation R^ = Rp. 

Let V = {vi, . . . ,Vn}- Let ipi be the projection of ip onto variable fj. ipi is one of the 
three unary relations {(0)}, {(1)}, and {(0), (1)}. Let V = {vi e V \ ^pi = {(0), (1)}}. {V 
is the set of variables that are not pinned in Rp.) For Vi G V and vj G V, let tpij be the 
projection of ip onto variables Vi and Vj. ipij is a binary relation. Of the 16 possible binary 
relations, the only ones that can occur are EQ, NEQ and {0, 1}^. The empty relation is 
ruled out since i^i;' is not empty. The four single-tuple binary relations are ruled out since 
Vi and Vj are in V . For the same reason, the other four two-tuple binary relations are ruled 
out. The three-tuple binary relations are ruled out since ipij ^ IDi. 

We define an equivalence relation ~ on V in which Vi ~ vj iff ipij G {EQ, NEQ}. Let V" 
contain exactly one variable from each equivalence class in V. Let k = \V"\. For convenience, 
we will assume V" = {vi, . . . ,Vk}. (This can be achieved by renaming variables.) 

Now, for every assignment x : {vi, . . . ,Vk} — ?■ {0,1} there is exactly one assignment 
y : {ffc+i, . . . ,Vn} —7- {0, 1} such that Rp{x,y) = 1. Let cr^x) be this assignment y. Now, 
define the arity-fc function G by G{x) = F{x,a{x)). Note that 

(18) G{x)= J2 ^(^'2/)' 



where y is an assignment y : {vfc+i, . . . , f^} — )■ {0,1}. Clearly, from (18), G G {F,B{)^^p. 
Also, by construction, G{x) is a permissive function so we can apply Lemma [s] We finish 
with two cases. 

Case 1. Every 2-pinning of G is log-modular. Then G is log-modular, by Lemma [8j 
This means (see, for example, ^ Proposition 24]) that g = log2 G is an affine function of 
xi, . . . ,Xk and xi, . . . ,Xk so G G (NEQ, B^). For example, if g = qq + aiXi + 02X2 + 033:3 
then G can be written as 

G{xi,X2,X3) = ^Uo f/l(xi)f/2(x2)f/3(?/3)NEQ(x3,?/3), 

where Uq = 2"o G and Ui{x) = T^'-^ G B\. Since F{x,y) = Rp{x,y)G{x), we conclude 
that F G (NEQ,i3^). 

Case 2. There is a 2-pinning G' of G that is not log-modular. Since G is strictly positive. 



so is G'. Since G G (F, B^^^p, so is G' . By Lemma[T2| (parts (iv) or (v)), IMP G (G', B^^^p. 
By Lemma 11 IMP G (F, B()^^p. 
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Finally, we consider the case in which C = IL2. Let ©3 be the relation {(0, 0, 0), (0, 1, 1), 
(1, 0, 1), (1, 1, 0)} containing all triples whose Boolean sums are 0. From the plain basis of 
IL2 (Table we see that the relation ©3 is in C, so {F, E\) contains a function F' with 
Rp' = ©3- Let F" be the symmetrisation of F' implemented by 

F"{x, y, z) = F'{x, y, z)F'{x, z, y)F'{y, x, z)F'{y, z, x)F'{z, x, y)F'{z, y, x). 

Now let /io = F"(0,0,0) and /i2 = F"(0, 1, 1). Let U be the unary function with U{0) = 
fiQ^^^ and U{1) = fJ'l/^ ■ Note that since F G S^, the appropriate roots of /iq and 
/i2 are efficiently computable, so f/ G B^. Now Q)3{x,y,z) = U{x)U{y)U{z)F"{x,y, z), so 
©3 G {F, Bl). Finally, let U' be the unary function defined by U'{0) = 1 and U'{1) = 2 and let 
G{x,z) = i\®z{x,y,z)U'{y). Note that G(0,0) = 1) = 1 and G(0, 1) = G(1,0) = 2. 
By Lemma [l| G is in {F,B{). But by Lemma [l2[ IMP G {G,Bl)^^p so by Lemma g 

IMPG(F,i30a;,p. 

We now prove Part 2 of the theorem. Suppose that F is not Ism and that F ^ (NEQ, B{) 
so, by Part 1 of the theorem, we have IMP G (F, B\)t^^p. Let 

H{XI,X2) = 5^IMP(|/i,Xi)IMP(|/i,X2)IMP(xi,l/2)IMP(x2,l/2). 

Note that i/(0, 0) = H{1, 1) = 2 and i/(0, 1) = H{1, 0) = L Now for any integer fc, let 



X\ , . . . , Xji ) ^ F{yi,...,yn)Y[H( 

yi,---,yn i=i 



Xi, yi) ■ 



By construction, is strictly positive. Also, as k gets large, Hk{xi, . . . ,Xn) gets closer 
and closer to 2^^F{xi, . . . Thus, for sufficiently large k, H/^ is not Ism. By Lemma [l| 

H G {F,Bi)uj,p so Hk G {F,Bi)ui,p. Applying Lemma [s] to Hk, there is a binary function 
Fi G {F, Bi) that is not Ism so 

Fi(0,0)Fi(l,l) <Fi(0,l)Fi(l,0). 



By Parts (iii) and (v) of Lemma |T2| we either have NEQ G (F, B{) or OR G (F, -Bf). In the 
latter case, we are finished by Lemma 15 In the former case, we are also finished since (in the 
notation of the proof of Lemma pi) M(NEQ)M(IMP) = M(OR) so OR G (IMP, NEQ). □ 



10 Complexity-theoretic consequences 



In order to explore the consequences of Theorem 16 for the computational complexity of 
approximately evaluating #CSPs, we recall the following definitions of FPRASes and AP- 
reductions from [T8] . 

For our purposes, a counting problem is a function U from instances w (encoded as 
a word over some alphabet E) to a number n{w) G M-*^. For example, w might encode 
an instance / of a counting CSP problem #CSP(F), in which case n{w) would be the 
partition function Z{I) associated with /. A randomised approximation scheme for 77 is a 
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randomised algorithm that takes an instance w and returns an approximation Y to 11 (w). 
The approximation scheme has a parameter e > which specifies the error tolerance. Since 
the algorithm is randomised, the output F is a random variable depending on the "coin 
tosses" made by the algorithm. We require that, for every instance w and every e > 0, 

(19) Pr [e-'n{w) <Y < e'n{w)] > 3/4 . 

The randomised approximation scheme is said to be a fully polynomial randomised approxi- 
mation scheme, or FPRAS, if it runs in time bounded by a polynomial in \w\ (the length of 
the word w) and e~^. See Mitzenmacher and Upfal [301 Definition 10.2]. Note that the quan- 



tity 3/4 in Equation (19) could be changed to any value in the open interval (1/2, 1) without 
changing the set of problems that have randomised approximation schemes [261 Lemma 6.1]. 

Suppose that Ui and 772 are functions from E* to M-*^. An "approximation-preserving 
reduction" (AP-reduction) [18] from ili to gives a way to turn an FPRAS for 772 into an 
FPRAS for III. Specifically, an AP-reduction from IIi to 77^2 is a randomised algorithm A 
for computing IIi using an oracl^for 7^2- The algorithm A takes as input a pair {w,e) G 
E* X (0, 1), and satisfies the following three conditions: (i) every oracle call made by A is 
of the form {v,6), where v E S* is an instance of 77^2, and < 5 < 1 is an error bound 
satisfying 6^^ < polj{\w\,e~^); (ii) the algorithm A meets the specification for being a 
randomised approximation scheme for IIi (as described above) whenever the oracle meets 
the specification for being a randomised approximation scheme for 7^2! and (iii) the run- 
time of A is polynomial in \w\ and e~^. Note that the class of functions computable by 
an FPRAS is closed under AP-reducibility. Informally, AP-reducibility is the most liberal 
notion of reduction meeting this requirement. If an AP-reduction from Ui to 7^2 exists 
we write Ui <ap 77^2 • If -^1 <ap -^2 and 77^2 <ap -^1 then we say that 77^1 and 77^2 ore 
AP-interreducible, and write 77^1 =ap 77^2- 

A word of warning about terminology. Subsequent to [18] the notation <ap has been 
used to denote a different type of approximation-preserving reduction which applies to opti- 
misation problems. We will not study optimisation problems in this paper, so hopefully this 
will not cause confusion. 

The complexity of approximating Boolean ^^CSPs in the unweighted case, where the 
functions in F have codomain {0,1}, was studied earlier [20] by some of the authors of 
this paper. Two counting problems played a special role there, and in previous work in the 
complexity of approximate counting [TH]. They also play a key role here. 

Name #SAT 

Instance A Boolean formula (f in conjunctive normal form. 
Output The number of satisfying assignments of ip. 



■^The reader who is not familiar with oracle Turing machines can just think of this as an imaginary 
(unwritten) subroutine for computing 7T2. 
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Name #BIS. 

Instance A bipartite graph B. 

Output The number of independent sets in B. 

An FPRAS for #SAT would, in particular, have to decide with high probability between 
a formula having some satisfying assignments or having none. Thus #SAT cannot have 
an FPRAS unless NP = RPj^ The same is true of any problem to which #SAT is AP- 
reducible. As far as we are aware, the complexity of approximating #BIS does not relate 
to any of the standard complexity theoretic assumptions, such as NP 7^ RP. Nevertheless, 
there is increasing empirical evidence that no FPRAS for 7^ BIS exists, and we adopt this as a 
working hypothesis. Of course, this hypothesis implies that no ^^BlS-hard problem (problem 
to which #BIS is AP-reducible) admits an FPRAS. 

Finally, a precise statement of the computational task we are interested in. A (weighted) 
#CSP problem is parameterised by a finite subset J-" of and defined as follows. 

Name #CSP(J^) 

Instance A pps-formula ip consisting of a product of m atomic J-'-formulas over n free vari- 
ables X. (Thus, ip has no bound variables.) 

Output The value J2xe{o 1}" -^V'(^) where is the function defined by that formula. 

Officially, the input size \w\ is the length of the encoding of the instance. However, we 
shall take the size of a #CSP(J-') instance to be + m, where n is the number of (free) 
variables and m is the number of constraints (atomic formulas). This is acceptable, as 
we are only concerned to measure the input size within a polynomial factor; moreover, we 
have restricted J-" to be finite, thereby avoiding the issue of how to encode the constraint 
functions J-". We typically denote an instance of #CSP(J-') by / and the output by Z{I); by 
analogy with systems in statistical physics we refer to Z{I) as the partition function. 

Aside from simplifying the representation of problem instances, there is another, more 
important reason for decreeing that J-" is finite, namely, that it allows us to prove the following 
basic lemma relating functional clones and computational complexity. It is, of course, based 
on a similar result for classical decision CSPs. 

Lemma 17. Suppose that J-" is a finite subset of B^. If F E {J^)u),p then 

#CSP(F,^) <AP #CSP(^). 

Proof. Let k be the arity of F. Let be a TM which, on input e' > 0, computes a fc-ary 
pps-formula ip over U EQ such that \\F^ — F\\oo < e' ■ We can assume without loss of 
generality that no function in {F} U J-" is identically zero (otherwise every #CSP instance 
using this function has partition function 0). Let /imax be the maximum value in the range 
of F and let ^rmn be the minimum of 1 and the minimum non-zero value in the range of F. 

'*Tlic supposed FPRAS would provide a polynomial-time decision procedure for satisfiability with two- 
sided error; however, there is a standard trick for converting two-sided error to the one-sided error demanded 
by the definition of RP Pl Thm 10.5.9]. 
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Similarly, let S be the set of non-zero values in the range of functions in J-'U {EQ}. Let z^max 
be the maximum value in 5* and let z/min be the minimum of 1 and the minimum value in S. 

Consider an input {I,e) where / is an instance of #CSP(F, J-") and e is an accuracy 
parameter. Suppose that / has n variables, m F-constraints, and m' other constraints. We 
can assume without loss of generality that m > (otherwise, / is an instance of #CSP(J-')). 

The key idea of the proof is to construct an instance /' of 7^CSP(J-') by replacing each 
F-constraint in I with the set of constraints and extra (bound) variables in the formula ip 
that is output by A4 with input e'. We determine how small to make e' in terms of the 
following quantities. Let 

A^max max 

f'-min 

i? = 2"(/i^ax+ir~V™L 

c 

Let e' = I^tb- The time needed to construct ip for a given e' > is at most poly(log {e'^'^)), 
which is at most a polynomial in n, m, m' and as required by the definition of AP- 
reduction. We shall see that {I',e/2) is the sought-for instance/tolerance pair required by 
our reduction. 

Let be the instance formed from / by replacing every F-constraint with an F^- 
constraint. Note that Z{I^) = Z{I'), since /', an instance of #CSP(J-'), is an implementation 
of I^. We want to show that if an oracle produces a sufficiently accurate approximation to 
Z{I') (and hence to Z{I^)) then we can deduce a sufficiently accurate approximation to Z{I). 
Observe that the definition of FPRAS allows no margin of error when Z{I) = 0, and our 
reduction must give the correct result, namely 0, in this case. Therefore we need to treat 
separately the cases Z{I) = and Z{I) > 0. We will show that 

(20) Z{I) = implies Z{L^) < C/3, 
and 

(21) Z{I) > implies Z{I^) > 2C/3; 
moreover, in the latter case, 

(22) e''/^Z{I) < Z{I^) < e''^Z{I). 

These estimates are enough to ensure correctness of the reduction. For a call to an oracle 
for 7^CSP(J-') with instance /' and accuracy parameter e/2 would return a result in the 
range [e~^/^Z(/^), e^/^Z(J^)] with high probability. Observe that this estimate is sufficient 



to distinguish between cases (20) and (21). In the former case, we are able to return the 



exact result, namely 0. In the latter case, we return the result given by the oracle, which 



by (22) satisfies the conditions for an FPRAS. 



To establish (20-22), let Y' be the set of assignments to the variables of instance / 
which make a non-zero contribution to Z{I) and let Y" be the remaining assignments to the 
variables of instance /. Let Z'{I^) be the contribution to Z{I^) due to assignments in Y' and 
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Z"{I^) be the contribution to Z{I^) due to assignments in Y" (so Z{I^) = Z'{I^) + Z"[I^)). 
We can similarly write Z{I) = Z'{I) + Z"{I), though of course Z"{I) = 0. 
First, note that if \F.^{x) - F{x)\ < e' and F{x) > then 

\F4x)/F{x) - 1| < e'/F{x) < e' / 
Fjx) 



so 



-2e7/^mm 



< 



F{x) 



< e 



We conclude that 



so 



Furthermore, 



Here we use I IF, 



e-2^''"/'^™Z'(J) < Z'{I^) < 62^'"^//^™ (J), 



\Z'{I)-Z'm)\<—Z{I)<e'A. 



\Z"{I)-Z"m)\=Z"m)<e'B. 
<£'<!; the " + 1" in the definition of B absorbs the discrepancy 



between F^ and F. Combining these two inequalities yields 



(23) 



\Z{I)- Z{I^)\<e'{A + B) < 



eC 



Now, Z(/) > implies Z{I) > C, and hence (|20|) and (|2T) follow directly from (g. If 
Z{I) > we further have 



Z{I) 



- 1 



Z{I) 



c 



This establishes (22) and completes the verification of the reduction. 



□ 



Theorem 18. Suppose is a finite subset of B^. 

• IfTC (NEQ,-Bn then, for any fimte subset S ofB^, there is an FPRAS for #CSP{J^, S). 

• Otherwise, 

o There is a finite subset S of B\ such that #BIS <ap #CSP(J-', S). 

o // there is a function F ^ T such that F ^ LSM then there is a finite subset S of 
Bl such that #SAT =ap #CSP(J^, S). 

Proof. First, suppose that C (NEQ,i3^). Let S* be a finite subset of B^. Given an m- 
constraint input / of ^CSP(J-', 5) and an accuracy parameter £, we first approximate each 
arity-/c function F E J-' used in I with a function F : {0, 1}^ — >■ Q-° such that Rp = Rp, 
and for every x for which F[x) > 0, 



(24) 



-e/m < -^(^) / e/m 

-F{x)-' ■ 
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Let J-" = {F I F G J-"} and let / be the instance of #CSP(J-', S) formed from / by replacing 
each F-constraint with F. [191 Theorem 4] gives a polynomial-time algorithm for computing 
the partition function Z{I), which satisfies 



(25) 



e-'Z{I) < Z{I) < e'Z{I). 



Second, suppose that J" ^ (NEQ,i35'). By Theorem 16, IMP e By Observa- 



tion pU there is a finite subset S of Bl such that IMP G S)^,p. Thus, #CSP(IMP) <ap 



#CSP(J^,S), by Lemma[T7l However, #BIS =ap #CSP(IMP) by ^ Theorem 3]. 

Finally, suppose that there is a function F E such that F ^ LSM. By Theorem 16 
= B^ so OR G {F, B\)i^^p. By Observation |3| there is a finite subset S of B\ 
such that OR G (F, so by Lemma [TtI #CSP(0R) <ap #CSP(F,S). However, by 

[201 Lemma 7] #SAT <ap #CSP(0R). To see that #CSP(J^,S) <ap #SAT, let I be an 



m-constraint instance of 7^CSP(J-', S). For each function G G J-'U S*, define G as in (|24j). Let 
/ be the instance of ^CSPdG | G G -FU S}) formed from / by replacing each G-constraint 
with a G-constraint. Equation (25) holds, as above. Furthermore, from [T^ Section 1.3] the 
problem of evaluating #CSP({G | G G FUS}) (even with G as part of the input) is in #Pq, 
the complexity class comprising functions which are a function in #P divided by a function 
in FP, so can be AP-reduced to ^^SAT. □ 



Example 19. Let F G i3f be the function defined by F(0, 0) = F(l, 1) = A and F(0, 1) = 
F(1,0) = 1, where A > 1. Then, from Theorem 18, #CSP(F, S*) is #BIS-hard, for some set 



S of unary weights. (In fact, this counting CSP is also #BIS-easy.) Note that #CSP(F, S) 
is nothing other than the ferromagnetic Ising model with an applied field. So we recover, 
with no effort, the main result of Goldberg and Jerrum's investigation of this model 



Example 20. If F is as before, but A G (0,1), then F ^ LSM and Theorem 18 tells us 



that #CSP(F, S*) is #SAT-hard, for some set 5* of unary weights. This is a restatement of 
the known fact that the partition function of the antiferromagnetic Ising model is hard to 
approximate ^23]. Actually, this is true without weights, but that is not directly implied by 
Theorem I 



11 The classes (LSMa;) 



In this section, we will be concerned with expressibility, and less so with efficient computabil- 
ity. Thus, we will use function classes without attempting to distinguish the efficiently 
computable functions in the class from the remainder. 



Define LSM^ = LSM n Bk- It follows from the proof of Lemma 12 that (LSM2) 



(IMP, Bl). Since LSM is central to Theorem 16 we need to consider whether LSM = (LSM2) 



and, if not, what internal structure it may possess. To this end, we consider here the 
functional clones (LSM^);^ for k > 2. As mentioned in the abstract, we will make use of two 
transforms: the Mobius transform to show (LSMs)^ = (LSM2)a; (in fact we prove the stronger 
statement (LSM3) = (LSM2)), and the Fourier transform to show that (LSM4)t^ 7^ (LSM2)cj. 
See [21 [361 ESj for corresponding results in the context of optimisation. 
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For all x,y E {0, 1}", we will write x < y to mean that for all 1 < z < n we have Xi < Ui 

For all / G define the Mohius transform f G by 

(26) f{y) = 5^(-l)l^-'^l/(it;) {y G {0,1}"), 

w<y 

and note that the Mobius transform is invertible: 

(27) f{x) = Y,fiy) {xe{0,ir). 

y<x 



See also [2^ for further information. We will not require anything other than (26) and 



(27). We next show that certain simple functions are in (LSM2). 



Lemma 21. Let y G {0, 1}". Let t G M, with t > Q if \y\ > 1. Let F G Bn be the unique 
function satisfying log F{y) = t and log-F(x) = for x ^ y. (Explicitly, F{x) = e* for 
y < X and F{x) = 1 otherwise.) Then F G (LSM2). 

Proof If t > define U G LSMi by f/(0) = 1 and U{1) = e* - 1. We will argue that for all 
X we have 

F{x) = J2uiz) n IMP(2,xO. 

z i-yi=l 

Indeed ii y < x we get F{x) = U{0) + U{1) = e\ and otherwise F{x) = U{0) = 1. 

Now we consider the case < 1. li y = let i = 1, and otherwise let i he the unique 
index with yi = 1. Then F{x) = U{xi) where f/(0) = F(0) and U{1) = F(l). Hence 
F G (LSMi) C (LSM2). □ 

If a; G {0, 1}", let x = l-x and, if F G Bn, let F{x) = F{x). We will use the following 
simple fact. 

Lemma 22. F G LSM^ iff F e LSM^. 

Proof. By symmetry, it suffices to show that, for any x,y G {0,1}'^, 

F{x)F{y) = F{x)F{y) < F{x Ay)F{xV y) 

= F{xVy)F{x7^) = F{x W y)F{x A y). □ 

Now we show that it is only necessary to consider permissive functions. The construction 
used in the lemma is adapted from one given, in a more general setting, by Topkis f31]. 

Lemma 23. For every F G LSM^ there exists G G LSM^*^ such that F = RpG. Furthermore 
Rf G (LSM2), so (LSMfc) = (LSM>°,LSM2). 

Proof. First, assume F{Q) 7^ 0. Let /i = mina;{F(a;) | F{x) 7^ 0}/ ma.y.xF{x). Set 

G{x) = max{F(?/)/il^l-l3'l \ y<x}. 
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Then G is strictly positive and G{x) = F(x) wherever F{x) ^ 0. It remains to show that 
G G LSM. For all x, x' there exist y < x and y' < x' such that 

G{x)G{x') = F(?/)F(t/>l"l-l^l+l"'l-l^'l 

< F{y A y')F{y V y^l^'l-l^/l+l-'l-li/'l 

= F{y A y')F{y V j;>l-A-'l-l2/Ay'|+|xv.'|-li/vy'| 

< G{x Ax')G{xy x'). 

Now we deal with the case F(0) = 0. Let F'{x) = F{x) for all a; 7^ and let F'(0) = 1. 
Then F' G LSM and F'{Qi) 7^ 0, and we have shown that there exists G G LSM^° such that 
F' = Rf'G. But then F = RpG. 

By [20l Corollary 18], Rp is a conjunction of implications and constants, and hence 
Rf G {60, 61, IMF) C (LSM2). Thus (LSMfe) C (LSM>°, LSM2). The reverse inclusion is 
trivial. □ 

Lemma 24. (LSM3) = (LSM2). 



Proof. From Lemma 23, we need only prove LSM>° C (LSM2). Thus let F G LSMf° 



_ '3 ; 

/ = logF, and first assume /(1, 1, 1) > 0. We will show that F G (LSM2). Note that, by 
log-supermodularity of F, 

7(1, 1, 0) = /(O, 0, 0) - /(I, 0, 0) - /(0, 1, 0) + /(1, 1, 0) > 0. 

and similarly /(I, 0, 1), /(0, 1, 1) > 0. Hence f{y) > for all \y\ > 1. For all y G {0, 1}^ 
let Fy be the unique function satisfying log Fy{y) = f{y) and log Fy{z) = ioi z y. 
Then logF = ^^logFy, which implies logF = J2y^'^&^yy which implies F = Yly^y By 
we have Fy G (LSM2) for all y, and therefore F G (LSM2). If 7(1,1,1) < 0, 



Lemma 



21 



let H = F. Note that H G LSM3 and logif(l, 1, 1) > 0, so by the previous paragraph, 
H G (LSM2). By Lemma|22|this implies F G (LSM2). □ 

In view of Lemma 24, it might be conjectured that LSM = (LSM2)aj- In fact, this is not 
the case, as we will now show. First we consider the class V of functions F & B for which 
the Fourier transform F has nonnegative coefficients, where 

(28) F{y) = ^ E (?/e{o,ir). 

ioe{o,i}" 



Thus F G P if and only if F G See [I7j for further information. We will use (28) and the 
convolution theorem: for all F,G E Bn we have 

(29) FG{x)= J2 Fiy)G{x®y) (a^, ?/ G {0, 1}"). 

yG{0,l}" 

where © denotes componentwise addition modulo 2. See for example [171 Section 2.3] for a 
proof of the dual statement. 

To show that V is closed under pps-formula evaluation, it is useful to restrict to atomic 
formulas where variables are not repeated within a scope. 
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Lemma 25. Let (1 B. For all pps-formulas ip over T there is another pps-formula tp' 
over J^U {EQ} such that = F^i and no atomic formula of ip' contains a repeated variable. 

Proof. Given obtain ip' as follows. For each variable Vi that is used di>2 times in total in 
ijj, replace the uses of Vi by new distinct variables v}, - ■ ■ , multiply by atomic formulas 
EQ(wj,f^) for 1 < j < (ij, then sum over these new variables vl- □ 

Lemma 26. V is closed under addition, summation, products and limits. Moreover, V is a 
pps^- definable functional clone. 

Proof, li F,G G V, then F + G = F + G is clearly non-negative, and FG is nonnegative 
by the convolution theorem (p9|). For summation, as in Lemma [61 we consider summing 



over the last variable. So, let F[{x) = '^tF{x,t). Then it follows easily from (28) that 
H{y) = 2F{y, 0) > for all y. For limits note that if F„ — )■ F then F„ — )■ F, and a hmit of 
non-negative functions is non-negative. 

Let ip he a pps-formula over V U {EQ}. We will argue that that F^ G V. By Lemma 



25 there is a pps-formula ijj' over V U {EQ} such that F^ = F^t and such that no atomic 
formula of ip contains a repeated variable. The functions F^ defined by atomic formulas 
ip = G{vi^, ■ ■ ■ ,Vi^) oi ip' are therefore "expansions": permutations of the function G' G Bn, 
n > k, defined by 

(30) G'{x, x') = G{x) [x G {0, 1}^ and x' G {0, 1}""^). 

It therefore suffices to check that V is closed under expansions. Let G' be the expansion 



defined by (g[. Then, for all y G {0, l}'^ and y' G {0, l}"-^ we have G'{y,y') = G{y) if 
y' = 0'', and G'{y, y') = otherwise, and hence G' G V. Note that EQ = ^EQ, so EQ G V. 
Thus P is a pps-definable functional clone, but it is also closed under limits. □ 

For F e B, let F* denote FF. Now let C be the class of functions F e B such that G* e V 
for every pinning G{x) = F{x; c). Note, in particular, that HUE Bi, U*{z) = f/(0)f/(l), a 
nonnegative constant. Therefore we have -Bi C C and, to establish that F G C, we need only 
check pinnings of F of arity at least 2. 

Lemma 27. C is a pps^- definable functional clone. 



Proof. As in Lemma 26 we will check that C is closed under "expansions", products, sum- 
mations, and limits. But a pinning of an expansion (or product, summation, or limit) of 
functions in C is an expansion (or product, summation, or limit) of pinnings of functions in 
C, which are necessarily in C because C is closed under pinnings. So it suffices to check the 
C condition for trivial pinnings, for example to check closure under products it suffices to 
show that F,G eC imphes (FG)* G V. 



Let G G C have arity k, let n > k, let G' be the function defined by (30). Note that G'G' 



is an expansion of GG, so G'G' G V and G' G C. We have EQ G C, since EQEQ = EQ G P. 



Closure under product follows from Lemma 26 and the observation that {FGy = F*G 



For summation, again as in Lemma |6| we consider summing over the last variable. Then, if 
H[x) = Y2t -^(^5 where F has arity A; + 1, then 

H^i^) = Et t) Et F{x, t) = (Forix) + iF^nx) + Zt F^i^, t). 
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where Fq and Fi are the pinnings Fi{x) = F{x] i). We have (-fo)*? (-^i)* & V hy the pinning 



assumption, and the arity k function F*{x, t) is in V by Lemma 26 Thus H* is the sum 
of three functions in V, and so, using Lemma [26] again , H* G V. Finally note that C is closed 
under limits: if — )■ F as n — oo then F* ^ F*, but V is closed under limits. □ 

Lemma 28. (LSMa)^ C C. 

Proof. Let F E LSMs- Note that F*{0, 0) = (F(0, 0)F(1, 1)+F(0, 1)F(1, 0))/2, and F^(0, 1) = 
0) = 0, and 1) = (F(0, 0)F(1, 1) - F(0, 0))/2 > 0. So F* G P, and hence 
F G C Thus LSM2 C C and, since C is a pps^-definable functional clone, (LSM2)a; ^ C □ 

Lemma 29. (LSMg)^ C (LSM4)^. 

Proof. Since LSM2 C C by Lemma [28} we need only exhibit a function F G LSM4 which is 
not in C. Define F : {0, 1}^ ^ M>° by 

if xi + X2 + X3 + X4 = 4; 
F(a;i,X2,X3,X4) = 2, if xi + X2 + + X4 = 3; 

otherwise. 

To show F G LSM4, by the symmetry of F and Lemma [8} it suffices to show that the three 
2-pinnings F(0, 0, X3, X4), F(0, 1, 0:3, X4) and F(l, 1, X3, 0:4) are Ism. This is equivalent to the 
inequalities lxl>lxl,2xl>lxl, and 4 x 1 > 2 x 2 respectively, which clearly hold. 



To show that F ^ C, we need only use (28) to calculate 



, 4x1-4x2+6x1-4x2+4x1 1 
1,1,1) = = "8 ^ °- 

Indeed F G (LSM4)a; but F ^C. By Lemma |28| (LSM2),. C (LSM4)c.nC C (LSM4)<^. □ 
Unfortunately, this approach does not seem to extend to showing (LSM4)^ C LSM or even 



(LSM4) C LSM. Neither can we extend the result of Lemma 24 to show that (LSM4)^ 
(LSMs)^^. However, we will venture the following, which is true for k = 1. 

Conjecture 30. For all k > 1, (LSM2fc)c. = (LSM2fc+i)^ C (LSM2fc+2)^. 



A consequence of a proof of Conjecture 30 would be that LSM 7^ for any finite set 
of functions J-". 



12 Restricted unary weights 

In this section and the next, we depart from the conservative case, and consider allowing 
only restricted classes of unary weights. 

We have already noted that restricting to nonnegative (as opposed to arbitrary real) unary 
weights produces a richer lattice of functional clones, and an apparently richer complexity 
landscape. Thus, by further restricting the unary functions available, we might expect to 
further refine the lattice of functional clones. 

In this section, we will consider the unary functions that favour 1 over 0, or vice versa. 
With a view to studying this setting, let Bf"""^ (respectively B'^^) be the class of unary 
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functions U from Bi such that f/(l) < f/(0) (respectively, f/(0) < U{1)). By Bf"'"'''^ and 
Bi^ we denote the efficient versions of these sets. Also denote by EQ' the permissive equality 
function defined by EQ'(x, y) = 2 if x = y, and EQ'(a;, y) = 1 otherwise. 

The first hint that partitioning Bi into B^^ and niay yield new phenomena comes 

from the observation that for any finite subset S of Q'^^^^^p^ the problem #CSP(EQ',5') 
reduces to the ferromagnetic Ising model with a consistent external field, for which Jerrum 
and Sinclair have given an FPRAS [25]. (A consistent field is one that favours one of the 
two spins consistently over every site.) The point here is that #CSP(EQ',S') is tractable. 



even though EQ' ^ (NEQ, Bi)^^. In contrast, by Theorem 18 or from the arguments in 
there is a finite subset S of B^ such that #CSP(EQ',S') — the ferromagnetic Ising model 
with local fields, with different spins favoured at different sites — is #BIS-hard. Of course 
similar remarks apply to Bi^'^. 

In terms of functional clones, the clone (EQ', Bf'^^'^)^^ is not amongst those we met in ^ 
it is incomparable with (NEQ, Bi)^, and strictly contained in (IMP,i3i)^. Specifically, we 
have 

Lemma 31. 

(i) NEQ ^ (EQ',i3^°-'^),. 
(zz) EQ' ^ (NEQ, Si),. 

(EQ',Sf-'^), C (EQ',i3i), = (IMP,Si),, 
and (EQ',Sf™).,p C (EQ',Sf),,, = (IMP,^^,,,. 
Proof. Recall the class V of functions with non-negative Fourier coefficients defined in ^pT| 



Note that P is a pps^^-definable functional clone by Lemma 26 Note that EQ'(0,0) = 6/4 
and EQ'(1, 1) = 2/4 and EQ'(0, 1) = EQ'(1,0) = 0, so EQ' E V. Also, for any U E Bi 
we always have f/(0) > 0, but U{1) = (t/(0) - U{l))/2 > if and only if U E -Bf™'^. So 
(EQ',i3f-°), CP. 

For (i) note that NEQ(1, 1) = -2/4 < so NEQ ^ V. 

For (ii). Remark [l4| showed that all functions in (NEQ,i3i)tj are products of atomic 
formulas. Therefore, if EQ' E (NEQ, Bi)^, it must have one of the three forms Ui{x)U2{y), 
Ui{x)EQ{x,y) or ?7i(x)NEQ(x, y), where Ui, U2 E Bi. Now note that EQ'{x,y) is not of any 
of these. 

For (iii), the inclusion {EQ' , Bf"""^) uj C (EQ',i3i)t^ is trivial. It is strict since, as we 



showed above, (EQ', -Bf ™°), n Bi = i3f The equality follows from Lemma 12 [iv). □ 



It is interesting to note that the strict inclusion between (EQ', Bf°'"^)^,p and (EQ', Bi)^^p 
is provable, even though the gap in the computational complexity of the related counting 
problems is only suspected. The other side of the coin is that two functional clones may differ, 
without there being a corresponding gap in complexity between the two counting CSPs. The 
main result of the section exhibits this phenomenon in a natural context: the two functional 
clones are incomparable, but there is an approximation-preserving reduction from one of the 
corresponding counting CSPs to the other. This is interesting, as it demonstrates that it is 
sometimes necessary, when constructing approximation-preserving reductions, to go beyond 
the gadgetry implied by the clone construction (even with the liberal notion employed here, 
including limits). 
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Recall that ©3 is the relation {(0, 0, 0), (0, 1, 1), (1, 0, 1), (1, 1, 0)}. 
Lemma 32. ©3 i (IMP, ""'^^^p and IMP i (©3, Sf""'^-,^ 

Prooj. First we show that ©3 i (IMP, iSf""'^.;,^. By Lemma (IMP, iSf ""'^)^,p C LSM. 
However, ©3 ^ LSM, since for x = (1, 1, 0) and y = (0, 1, 1), 

= ©3(0; V y) Ay)< ©3(x) ©3(2/) = 1. 
Now we show that IMP ^ (©3, -6^°^"^'^)^,^. Recall the class V of functions with non- 



negative Fourier coefficients defined in ^ 11 Note that P is a pps^-definable functional clone 
by Lemma |26| For all U G Sf™"" we have f7(0) = ([/(O) + U{l))/2 > and f7(l) = 
{U{0) - U{l))/2 > 0, so U e V. Also, ®s = ^EQ^ where EQ3 is the arity 3 equality 

relation {(0, 0, 0), (1, 1, 1)}. So (©3, -Sf ""'^.^.p ^ ^- But IMP(0, 1) = (1 - 1 - l)/4 < 0, so 
IMP ^V. ' □ 



We know now that IMP is not pps^^-definable in terms of ©3 and in contrast, we 

see in the next result that IMP is nevertheless efficiently reducible to ©3 and 

Lemma 33. There is a finite subset S of B'l"^^'^ such that 

#CSP(IMP) <AP #CSP(©3,^) 

Proof. First, we need some definitions. Suppose that M is a matrix over GF(2) with rows V 
and columns E with \V\ = n. For a column e and a "configuration" a : V ^ {0,1}, define 
6e{<j) to be Mj^e(T(i), where the addition is over GF(2). (5e(cr) is the parity of the 

number of Is in column e of M that are assigned to 1 by a. Given a parameter y > 0, the 
Ising partition function of the binary matroid A4 represented by M is given by 

a:V^{0,l} e£E 

Now, from [201 Theorem 3], #CSP(IMP) =ap #BIS. Also, for every efficiently approx- 
imable real number y > 1, from [231 Theorem 1] there is an AP-reduction from #615 to the 
problem of computing ^ismg(-^; v), for given M. 

The set of subsets ACE such that the submatrix corresponding to A has an even 
number of Is in every row is called the cycle space of Ai and is denoted C{M). A standard 
calculation expresses Zising{M.', y) in terms of C{M.). Let w = {y — l)/{y + 1). 
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a:V-^{0,l} ee-B 



ie<5e(o-) 




cr ACEeeA 



ACE a- e£A 

AeCiM) 



2» 



w 



\A\ 



AeC{M) 

Here is the justification of tfie penultimate line (wliy only A G C{Ai) contribute to tlie 
sum, and wliy tlie factor 2"): Suppose, for a set A C E, that some row i has has an odd 
number of I's in columns in A. Then for any configuration a' : V \ {i} {0, 1}, one of the 
contributions extending a' to domain V contributes —1 and the other contributes +1. On 
the other hand, if i has an even number of I's in A, then the two contributions are the same, 
so we just get a factor of 2 times the contribution from the smaller problem, without this 
row. 

Note that, since y > 1, we have < w < 1. Now the point is that it is easy to express the 
sum J2a€C{m) solution to an instance of #CSP(©3, Uw), where Uw is the unary 

function defined by Uy^{0) = 1 and Uw{l) = w. A vector x represents the choice of A ^ E 
— the j'th column is in A iS Xj = 1. Then the constraint that the submatrix corresponding 
to A has an even number of Is in some row, say row i, is given by the linear equation 



■.Mi ,=1 — 0- ■'■f ^^is linear equation has just two terms then it is an equality, and it can 
be represented in the CSP instance by substituting one variable for the other. Otherwise, it 
can be expressed using conjunctions of atomic formulas ©3. Thus, we have an AP-reduction 
from #CSP(IMP) to #CSP(©3, U^). □ 



Remark 34. Lemma [32] and 33 show that, as far as counting CSPs are concerned, the ex- 
pressibility provided by efficient pps^^-definable functional clones is more limited than AP- 
reductions. Here we show that the problem 7^CSP(IMP) is AP-reducible to a t^CSP problem 
whose constraint language consists of functional constraints from ©3 U ^^"""^'^^ we aren't 
able to express IMP using ©3Ui3^°'*'^'^. On the other hand, if the definition of efficient pps^^- 
definable functional clones were somehow extended to remedy this deficiency, then Lemma 17 



would probably have to be weakened. While we do know (from Lemma 33) that there is 



a finite subset S of ^^"'"^'P for which 7^CSP(IMP) <ap #CSP(©3,S'), the corresponding 
stronger statement from Lemma 17, #CSP(IMP, ©3, S) <ap #CSP(©3, S), is unlikely to be 
true since #CSP(IMP, ©3, S) =ap #SAT ^ Theorem 3]. 
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Remark 35. Lemma 32, as with the earher Lemma 31_ 
structure of efficient functional clones {J^)cj,p with q'^°^^'P 



shows that there may be a r ich 
C J^. By contrast, Theorem 



16 



and Lemma [7| guarantee that if C J^, then, the only possibilities are 

• (^kpC(NEQ,i30a.,p, or 

. (IMP,i3f)^,pC (^),,pCLSM, 

• or = BP. 



13 Using fewer weights 

We saw that a finite set of unary weights suffices to generate the functional clones that we 
encountered in previous sections. Here we observe that just one or two weights often suffice. 
In the following proposition, | denotes the constant nuUary function that takes value |. 

Lemma 36. 

(z) B, C (IMP, \)^ (B\ C (IMP, 
(ii) Br C (OR, (BT^ C (OR, i)^ J. 
(in) Bf--- C (NAND, i)^ f^f ""'^ C (NAND, D^J. 

Proof. Note that (iii) is the same as (ii) with the roles of and 1 reversed, so we will just 
prove (i) and (ii). We start with a general construction that works for both parts (i) and (ii) 
of the proposition. Let F be a binary function and /, J instances of #CSP(F). We assume 
the sets of variables of / and J are disjoint. The disjoint sum Jl±l J of / and J is the instance 
whose set of variables is the union of those of / and J, and F{x, y) is in / l±l J if and only 
if F{x, y) occurs in / or J . The ordinal sum / +< J of I and J is their disjoint sum along 
with every atomic formula F{x, y) such that a; is a variable of / and ?/ is a variable of J. 

Claim. 

(i) For any F G B2, Z{I W J) = Z{I) ■ Z{J). 

(ii) If F G {IMP, OR} then Z{I +< J) = Z{I) + Z{J) - 1. 

The ffist part is trivial. To show the second part consider an assignment (a:;, y) such that 
x,y map the variables of J, J, respectively, to {0,1}, and Fi+^J{x,y) 7^ 0. If F = IMP 
and any of the components of x equals 1, then y = 1. However, if a; = (or ^/ = 1) then y 
(resp. x) can be any legitimate assignment of J (resp. /). If F = OR then one of the x,y 
must be 1, while the remaining one can be any assignment with Fj{x) 7^ or Fj{y) 7^ 0. 
This completes the proof of the claim. 

Denote the instance consisting of a single variable without constraints by 2, the disjoint 
sum of k instances 2 by 2*^, and the ordinal sum I +< . . . +< I of k copies of instance 
I hj k ■ L (Note that the operator +< is associative, so this makes sense.) Let also | 

denote the instance consisting of a single nullary ^ function, and let denote the sum 
of k copies of |. Note that Z{2) = 2 and ^(|) = ^, justifying the notation. Note that 
for every natural number a and every positive integer i, Z[a ■ 2^) = a2^ — a + 1. (This 
can be proved by induction on a with base case a = 0, using Part (ii) of the claim for the 
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inductive step.) Furthermore, for every positive integer k, if oi, . . . , are natural numbers 

then Z{ai ■ 2^ +< h< ■ 2*^) = (ai2^ H h 0^2*^) - (ai H h a^) + 1. (This can be 

proved by induction on k using base case k = 1 using the previous observation.) 



Suppose G E Bi and let G'^"-' be a rational valued approximation to G such that G'*-"-'(a) 7^ 
and \G^'^\a) — G{a)\ < 2~". Assume that this rational approximation is given as a finite 

binary expansion, so that G^''\0) = 3^(00 + ai2^ H h 0^2^=) 7^ and G('*)(l) = 3^(60 + 

bi2^ H h 6^2^) 0. Let 7, J be instances of #CSP(F) given by 



From the observations above, Z{{ao H h Ofe - 1) • 2) = ao H h Ofc so Z{I) = 2'"G'(")(0). 

Similarly Z{J) = 2"*G(")(1). Let K), be the variables of I, J, respectively, and let Go 
(respectively, Ci) be the set of pairs {x,y) such that F{x,y) is an atomic formula of I 
(respectively, J). 

First suppose F — IMP. Consider the formula 



where c is a new variable and the sum is over all variables in Vq U Vi. An assignment 
cc : Vo U Vi U {c} {0, 1} can only contribute to F^^ if either: x{c) = and x{b) = for 
all b e Vi, or x{c) = 1 and x{a) = 1 for all a e Vq. Hence F^„(0) - 2-"*Z(7) = G(")(0) and 
F^„(l)=2— Z(J) = GW(1). 

Now suppose F = OR, and let G G B^^ and G^"^^ be as before, but with the restriction 
G{1) > G(0). This time, let G(")(0) = ^{ao + ai2^ + ■ ■ ■ + ak2^) ^ and ^("^(l) - ^("^(O) = 
2^(60 + bi2^ + . . . + be2^) ^ 0. Here we are using the fact that G(")(l) - G(")(0) > 0, which 
follows from G{1) > G{0) for sufficiently large n. Let instances 7, J be defined for the values 
oo + ai2^ + • • • + afc2^, 1 + bo + bi2^ + • • • + 6^2^ in the similar way to before (but note the 
extra 1); and let Vq, Vi, Go, Gi again denote the set of variables and constraints of I, J. As 
before Z{I) = 2™G(")(0) and Z{J) = 2™(G'(")(1) - G^''\0)) + 1. Let TsT = / +< J and let G 
be the set of scopes of the constraints in K. Consider the formula 



where c is a new free variable and the sum is over all variables in VqU Vi. An assignment 
X : VqUViU {c} — > {0, 1} can only contribute to F^^ if either: x{c) — and x{b) — 1 for all 
b e Vo, or x{c) = 1. Therefore F^„(0) = 2-"'Z{I) = ^("^(O) and F^„(l) = 2-'"Z{I +< J) = 
2-™(Z(/) + Z{J) - 1) = G(")(l)- 

To obtain the efficient version of the proposition let Mq and Mi be TMs that, given n, 
compute the first n bits of G{0) and G(l) respectively, in time polynomial in n. Then a TM 
M' that constructs G^ G {F, |) such that \\Gs — G||oo < £ works as follows. First, it finds 
the smallest n such that n > log£~^. Then it runs Mo and Mi on input n to find G(")(0) 
and G*^"^(l). Finally, M' outputs the formula ^n- The running time of M' is the sum of: the 
time to run Mq, the time to run Mi, and time 0(log^£~^) to construct ijjn- □ 



7 = ai • 2^ +< • • • +< Ofe • 2*= +< (ao + ai + • • • + Ojfc - 1) • 2, 
J = bi ■ 2^ +< ■ ■ ■ +<be ■ 2^ +< {bo + bi + ■ ■ ■ + be - 1) ■ 2. 
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Corollary 37. Let G E Bi. (For the results on efficient pps-definability assume G E B^.) 

(i) If G(0) > G{1) and G{1) ^ then Bi C (OR, G, D^, 
andB\ C (OR,G,^)^,p. 

(ii) IfG{0) < G{1) and G{0) ^ then Bi C (NAND,G, D^, 
andBl C (NAND, G, 

Proof. We prove (i), as (ii) is quite similar. Let if be a function in Bi. If -ff(O) < H{1) then. 



by PropositionpM H G (OR, \)^ (or H e (OR, Assume H{{}) > H{1) 



that > Sliy- Let ii' = H/G^ Then H' G B"^" so, by Proposition 



. There is k such 
H' G (OR, l)^. 
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Hence H G (OR, G, Also, if G, ii G then H' G -B^P'^ so ii' G (OR, Hence 
G (OR,G,|)^,p. ' □ 

Corollary 38. B C (OR, NAND, and -B^ C (OR, NAND, 

Proo/. Let F G -B^. Let [/(x) = ^ NAND(x,?/). Then f/(0) = 2 and f/(l) = 1 and 



U G (OR, NAND, |)^,p. By Corollary 37| we have B^ C (OR, [/, ^)^,p, and by Lemma |l5 
F G (OR, So i^ G (OR, NAND, \jZ,p by Lemma |4| ' □ 
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